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Abstract 

We consider the problem of optimal trading for a power producer in the context 
of intraday electricity markets. The aim is to minimize the imbalance cost induced by 
the random residual demand in electricity, i.e. the consumption from the clients minus 
the production from renewable energy. For a simple linear price impact model and a 
quadratic criterion, we explicitly obtain approximate optimal strategies in the intraday 
market and thermal power generation, and exhibit some remarkable properties of the 
trading rate. Furthermore, we study the case when there are jumps on the demand 
forecast and on the intraday price, typically due to error in the prediction of wind power 
generation. Finally, we solve the problem when taking into account delay constraints 
in thermal power production. 
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1 Introduction 


The development of renewable energy sources in Europe as a response to global climate 
change has led to an increase of exchange in the intraday electricity markets. For instance, 
the exchanged volume on the European Energy Exchange (EEX) for Germany has grown 
from 2 TWh in 2008 to 25 TWh in 2013. This increase is mainly due to the level of 
forecasting error of wind production, which leads power producers owning a large share of 
wind production to turn more than ever to intraday markets in order to adjust their position 
and avoid penalties for their imbalances. The accuracy of forecasts for renewable power 
production from wind and solar may vary considerably depending on the agreggation level 
(local vs regional forecast) and the time horizon. For a complete survey on this problem, the 
reader can consult Giebel et al. j5], and may have in mind that the root mean square error 
(RMSE) of the error forecast for the production of a wind farm in six hours can reach 20% of 
its installed capacity. Many different intraday markets have been designed and are subject 
to different sets of regulation. But, in all cases, intraday markets offer power producer the 
possibility to buy or sell power for the next (say) 9 hours to 32 hours (case of the French 
electricity market of EpexSpot). These trades can occur after the closing of the day-ahead 
market or during the clearing phase of the day-ahead market. 

The problem of trading management in the intraday electricity market for a balancing 
purpose has already drawn the attention in the literature. Henriot [6] studied the problem 
of how the intraday market can help a power producer to deal with the wind production 
error forecast in a stylized discrete time model. In his model, the power producer is a 
wind producer who is trying to minimize her sourcing cost on the intraday market while 
maintaining a balance position between her forecast production and her sales. Henriot’s 
model takes into account the impact of the wind power producer on the intraday price with 
a deterministic inverse demand function, and the intraday price is not a risk factor. The only 
risk factor comes from the error forecast of the wind production and its auto-correlation. 
Gamier and Madlener [4] studies the trade-off between entering into a deal in the intraday 
market right now and postponing it in a discrete time decision model where intraday prices 
follow a geometric Brownian model and wind production error forecast follows an arithmetic 
Brownian motion. In their framework, the power producer is supposed to have no impact 
on intraday prices. Liquidity risk is taken into account as a probability of not finding a 
counter-party at the next trading window. 

In this paper, we consider a power producer having at disposal some renewable energy 
sources (e.g. wind and solar), and thermal plants (e.g. coal, gas, oil, and nuclear sources), 
and who can buy/sell energy in the intraday markets. Her purpose is to minimize the 
imbalance cost, i.e. the cost induced by the difference between the demand of her clients 
minus the electricity produced and traded, plus the production and trading costs. In contrast 
with thermal power plants whose generation can be controlled, the power generated from 
renewable sources is subject to non controllable fluctuations or risks (wind speed, weather 
forecast) and is then considered here as a random factor just like the demand. We then call 
the residual demand the demand minus the energy generated by renewable energy. Thus, the 
problem of the power producer is to minimize the imbalance costs arising from her residual 
demand by relying both on her own controllable thermal assets and on the intraday market. 
As in [33, we assume that the power producer has access to a continuously updated forecast 
of the residual demand to be satisfied at terminal date T and that this forecast evolves 
randomly. Moreover, the intraday price for delivery at time T evolves also randomly and 
is correlated with the residual demand forecast. However, compared with [4], the intraday 
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market can be used for optimization purposes. We develop a model that allows us to study 
how power producers can take advantage of the interaction between the dynamics of the 
residual demand forecast and the dynamics of the intraday prices. 

Our model shares some links with optimal order execution problems, as introduced in 
the seminal paper by Almgren and Chriss j2], and then largely studied in the recent lit¬ 
erature, see e.g. the survey paper [9], In our context, the original feature with respect 
to this literature is the consideration of a random demand target and the possibility for 
the agent to use her thermal power production. This connection with optimal execution 
is fruitful in the sense that it allows us to take into account several features of intraday 
markets while maintaining the tractability of the model sufficiently high to allow analytical 
solutions. Hence, we take into account liquidity risk through a market impact, both perma¬ 
nent and temporary, on the electricity price generated by a power producer when trading 
in the intraday market. As in optimal execution problems, this impact is always in the 
adverse direction: when the producer sells, the price decreases and when she buys, the price 
increases. Our setting is a continuous-time decision problem representing the possibility for 
the producer to make a deal at each time she wants and not only at pre-specihed windows. 
Moreover, it is general enough as it permits us to study the limiting cases of a pure retailer 
(no production function), a pure trader (no demand commitment) and an integrated player 
(player owning both clients and generation), small or large. 

The main goal of this paper is to derive analytical results, which provide explicit solutions 
for the (approximate) optimal control, hence giving enlightening economic interpretations 
of the optimal trading strategies. In order to achieve such analytical tractability, we have 
to make some simplifying assumptions on the dynamics of the price process and of the 
residual demand forecast, as well as on the cost function, assumed to be of quadratic form 
meaning a simple linear growth of the marginal cost of production with respect to the 
production level. We first consider a simple model for a continuous price process with linear 
impact, and demand forecast driven by an arithmetic Brownian motion, and neglect in a 
first step the delay of production when using thermal power plants. We then study an 
auxiliary control problem by relaxing the nonnegativity constraint on the generation level, 
for which we are able to derive explicit solutions. The approximation error induced by this 
relaxation constraint is analyzed. In next steps, we consider more realistic situations and 
investigate two extensions: (i) On one hand, we incorporate the case where the residual 
demand forecast is subject to sudden changes, related to prediction error for wind or solar 
power production, which may be quite important due to the difficulties for estimating wind 
speed and forecasting weather, see £3J. This is formalized by jumps in the dynamics of the 
demand process, and consequently also on the price process. Again, we are able to obtain 
explicit solutions. Actually, the key tool in the derivation of all these analytical results is a 
suitable treatment of the linear-quadratic structure of our stochastic control problem, (ii) 
On the other hand, we introduce natural delay constraints in the production, and show how 
the optimal decision problem can be explicitly solved by a suitable reduction to a problem 
without delay. 

Our (approximate) optimal trading strategies present some remarkable properties. When 
the intraday price process is a martingale, the optimal trading rate inherits the martingale 
property, which implies in particular that the net position of electricity shares has a constant 
growth rate on average. Moreover, the optimal strategy consists in making at each time 
the forecast marginal cost equal to the forecast intraday price. This property follows the 
common sense of intraday traders. Consequently, if the producer has made sales or purchases 
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on the day-ahead such that her forecast marginal cost equals the day-ahead price and if the 
initial condition of the intraday price is the day-ahead price, thus, on average, the producer 
optimal trading rate is zero. This fact is no longer true when the demand forecast and the 
price follow processes with jumps. In this case, the optimal trading rate is a supermartingale 
or a subnrartingale depending on the relative probability and size of positive and negative 
jumps on the price process. For this reason, contrary to the case without jumps, the power 
producer may need to have a non-zero initial trading rate even if she has made sales or 
purchases on the day-ahead such that her forecast marginal cost equals the day-ahead price 
and if the initial condition of the intraday price is the day-ahead price. We also quantify 
explicitly the impact of delay in production on the trading strategies. When the price 
process is a martingale, the net inventory in electricity shares grows linearly on average, 
with a change of slope (which is smaller) at the time decision for the production. 

The outline of the paper is organized as follows. We formulate the optimal trading 
problem in Section 2. In Section 3, we study the optimal trading problem without delay. 
We first solve explicitly the auxiliary optimal execution problem, and then study the ap¬ 
proximation on the solution to the original problem, by focusing in particular on the error 
asymptotics. We illustrate our results with some numerical tests and simulations. We extend 
in Section 4 our results to the case where jumps in demand forecast may arise. In Section 
5, we show how the optimal trading problem with delay in production can be reduced to a 
problem without delay, and then leads to explicit solutions. Finally, the appendix collects 
the explicit derivations of our solutions, which are justified by verification theorems. 

2 Problem formulation 

We consider an agent on an intraday energy market, who is required to guarantee her 
equilibrium supply/demand for a given fixed time T: she has to satisfy the demand of her 
customers by purchase/sale of energy on the intraday market at time T and also by means 
of her thermal power generation. We denote by Xf the net position of sales/purchases of 
electricity at time t < T for a delivery at terminal time T, assumed to be described by 
an absolutely continuous trajectory up to time T, and by qt = Xf the trading rate: qt > 
0 means an instantaneous purchase of electricity, while qt < 0 represents an instantaneous 
sale at time t: 

X t = X 0 + f q s ds , 0 < t < T. (2.1) 

Jo 

Given the trading rate, the transactions occur with a market price impact: 

Pt(q) = Pt+ [ g(q s )ds + f(q t ). 

Jo 

Here, (Pt)t is the unaffected intraday electricity price process on a filtered probability space 
(HjJ 7 ,F = (•F"t)te[o,T])F > ); carrying some part of randomness of the market, and following 
the terminology in the seminal paper by Almgren and Chriss [2], the term f(qt) refers to 
the temporary price impact, while /q g(q s )ds describes the permanent price impact. The 
price (Pt)t may be seen as a forward price, evolving in real time, for delivery at time T. Let 
us then denote by Y the intraday electricity price impacted by the past trading rate q of 
the agent, defined by: 

Y t := Pt+ [ g(q s )ds. 

Jo 
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We assume that Yf is observable and quoted, which means actually that the agent is a large 
trader and electricity producer, whose actions directly impact the intraday electricity price. 
The case where the agent is a small producer can be also dealt with by simply considering 
a zero permanent impact function g = 0. Notice that the transacted price is equal to the 
sum of the quoted price Y and the temporary price impact: 

Pt(q) = Y t + f(qt). ( 2 . 2 ) 


The residual demand Dt is the consumption of clients of the agent minus the production 
from renewable energy at terminal date T, and we assume that the agent has access to a 
continuously updated forecast {Dt)t of the residual demand. The agent can use her thermal 
power production with a quantity £ at cost c(£) in order to match as close as possible the 
target demand Dt- In practice, generation of electricity can not be obtained instantaneously 
and needs a delay to reach a required level of production. Hence, the decision to produce 
a quantity £ should be taken at time T — h, where h E [0, T] is the delay. Thus, for a 
controlled trading rate q = (qt)t £ A, the set of real-valued E-adapted processes satisfying 
some integrability conditions to be precised later, a production quantity £ E L { +(pT-h) ■ the 
set of nonnegative Tt —^-measurable random variables, the total cost is: 


q t Pt{q)dt + C(D T -X T ,Z ) : = 


q t Pt(q)dt + c(£) + 1(D T -X T - £) 2 . (2.3) 


The first term in (2.3) represents the total running cost arising from the trading in the 


intraday electricity market, and the last term, where 77 > 0 , represents the quadratic pe¬ 
nalization when the net position in sales/purchases of electricity Xt + £ (including the 
production quantity £ at cost c(£)) at terminal date T does not fit the effective demand Dt- 
The objective of the agent is then to minimize over q and £ the expected total cost: 


minimize over q E A, £ E L°XpT-h 


E 


q t P t (q)dt + C(D T -X T ,0 . (2.4) 


Remark 2.1 1) The penalization term in the objective function above is a simplification of 
the effective penalization process that can be found in real electricity markets. For instance, 
the penalization of imbalances in the French electricity market depends both on the sign 
of the imbalance of the electricity system and on the price of imbalances (see |lj chap 2 ., 
sec. 2.2.1]). Nevertheless, the positive coefficient 7] captures the main objective of the 
penalization process. The agent has no incentive of being either too long or too short. 

2) On real markets, trading ends some time before the date of delivery, at which the agent 
has to ensure equilibrium (e.g. on the French electricity market, there is a delay of 45 
minutes). We do not include that practical fact in our framework, by considering that the 
delay is null for the sake of clarity. There is no mathematical consequence: it is enough to 
have in mind that the delivery and production do not really take place at T, but at T plus 
some delay. 

3) The larger is r], the stronger is the incentive for the agent to be as close as possible 
to the equilibrium supply-demand. At the limit, when 77 goes to infinity, the agent is 
formally constrained to fit supply and demand. However, the limiting problem when 7] = 
00 is not mathematically well-posed since such perfect equilibrium constraint is in general 
not achievable. Indeed, the demand at terminal date T is random, typically modelled via a 
Gaussian noise, and the inventory X which is of finite variation, may exceed or underperform 
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with positive probability the demand Dt at terminal date T. Hence, in the scenario where 
Xt > Dt, and since by nature the production quantity £ is nonnegative, it is not possible 
to realize the equilibrium Xt + £ = Dt, even if there is no delay. In the sequel, we fix r) > 


0 (which may be large, but finite), and study the stochastic control problem (2.4). 


4) The optimization problem (2.4) shares somes similarities with the optimal execution 


problem in limit order book studied in the seminal paper by Almgren and Chriss [2], and 
then extended by many authors in the literature, see e.g. the survey paper [9]. The main 
difference is that in the execution problem of equities, the target is to buy or sell a certain 
number of shares, i.e. lead Xt to a fixed constant (meaning formally that r] goes to infinity) 
while in our intraday electricity markets context, the target is to realize the equilibrium with 
the random demand Dt, eventually with the help of production leverage £. However, in 
contrast with the case of constant target, it is not possible in presence of random target Dt 
to achieve perfectly the equilibrium, which justifies the introduction of the penalty factor r] 
as pointed out above. □ 

The main aim of this paper is to provide explicit (or at least approximate explicit) 


solutions to the optimization problem (2.4), which are easily interpreted from an economic 


point of view, and also allow to measure the impact of the various parameters of the model. 
In order to achieve this goal, we shall adapt our modeling as close as possible to the linear- 
quadratic framework of stochastic control, and make the following assumptions: The energy 
production cost function is in the quadratic form: 

f \ P 2 
c{x) = -x , 

for some (5 > 0. Although simple, a quadratic cost function represents the increase of the 
marginal cost of production with the level of production. 


Remark 2.2 (Pure retailer) In the limiting case when (5 goes to infinity, meaning an infi¬ 
nite cost of production, this corresponds to the framework where the agent never uses the 
production leverage and only trades in the intraday-market by solving the optimal execution 
problem: 


minimize over q G A 


E 


q t P t (q)dt + C(D t - X T , 0 ) 


(2.5) 

□ 


As in Almgren and Chriss, we assume that the price impact (both permanent and 
temporary) is of linear form, i.e. 


g(q) = vq, f(q) = iq, 


for some constants v > 0 and 7 > 0. The unaffected intraday electricity price is taken as a 
Bachelier model: 


P t = Po + <?aW t , (2.6) 

where IT is a standard Brownian motion, and do > 0 is a positive constant. Such assumption 
might seem a shortcoming at first sight since it allows for negative values of the unaffected 
price. However, in practice, for our intraday execution problem within few hours, negative 
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prices occur only with negligible probability. The martingale assumption is also standard 
in the market impact literature since drift effects can often be ignored due to short trading 
horizon. The quoted price Y, impacted by the past trading rate q G A, is then governed by 
the dynamics: 


dY t = uq t dt + (T 0 dW t . (2.7) 

The dynamics of the residual demand forecast is given by 

dD t = pdt + (JddB t , (2-8) 


where p, are constants, with <7^ > 0, and B is a Brownian motion correlated with W: 
d <W,B > t = pdt, p G [—1,1]. 

From (2.2), one can then define the value function associated to the dynamic version of 

J(t,x,y,d\q,C) (2.9) 


the optimal execution problem (2.4) by: 


v(t,x,y,d ) := inf 

(Fr-h) 


with 


J(t,x,y,d-,q,0 := e[ / q s {Y^ + iqs )ds + C{D^ d - X^ x 


L Jt 


( 2 . 10 ) 


for (t, x, y,d ) G [0, T] X M x M x M, where At denotes the set of real-valued proce sses q = 
( Qs)t<s<T s.t. q s is Js-adapted and E[ J' f q^ds] < oo, D t,d is the solution to (2.8) starting 
from d at t, and given a control q G At, Y t,v denotes the solution to (2.7) starting from y 


at time t. and X t,x is the solution to (2.1) starting from x at t. 


In a first step, we shall consider the case when there is no delay in the production, and 
then show in the last section of this paper how to reduce the problem with delay to a no 
delay problem. We shall also study the case when there are jumps in the residual demand 
forecast. 


3 Optimal execution without delay in production 

In this section, we consider the case when there is no delay in production, i.e. h = 0. In this 


case, we notice that the optimization over q and £ in (2.4) is done separately. Indeed, the 
production quantity £ G LR_{Bt) is chosen at the final date T, after the decision over the 
trading rate process (qt)te[o,T\ I s achieved (leading to an inventory Xt)- It is determined 
optimally through the optimization a.s. at T of the terminal cost C(Dt — Xt,£), hence in 
feedback form by £^ = £ + (Dt — Xt) where 


£ + (d) := argminC^d,£) = argmin [—£ 2 + -(d — £) 2 ] 


«>o 

V 

7/ + /T 


£>o 


dt 


d> 0- 


The value function of problem (2.9) may then be rewritten as 


(3.1) 


v(t,x,y,d ) = inf E / q s (Yg’ y + jq s )ds + C + (D^ d - X^ x ) 2 

geAt L J t 


(3.2) 
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where 


C + {d) : = 


C{d,Z 

1 vP 

2 T) 


-(d)) 

pd 2 td>o + ^d 2 lrf<o- 


(3.3) 


and the optimal trading rate q* is derived by solving (3.2). 


Due to the indicator function in C + , caused by the non negativity constraint on the 


production quantity, there is no hope to get explicit solutions for the problem (3.2), i.e. solve 


explicitly the associated dynamic programming Hamilton-Jacobi-Bellman (HJB) equation. 
We shall then consider an auxiliary execution problem by relaxing the sign constraint on the 
production quantity, for which we are able to provide explicit solution. Next, we shall see 
how one can derive an approximate solution to the original problem in terms of this auxiliary 
explicit solution, and we evaluate the error and illustrate the quality of this approximation 
by numerical tests. 

3.1 Auxiliary optimal execution problem 

We consider the optimal execution problem with relaxation on the non negativity constraint 
of the production leverage, and thus introduce the auxiliary value function 

v{t,x,y,d) := inf J(t,x,y,d\q,£), 
q&A£&L°{F T ) 


for ( t,x,y,d) € [0,T] xKxMxR. By same arguments as for the derivation of (3.2), we 
have 


v(t, x, y, d) = inf E / q s {Xs ,v + lQs)ds + C(D^ d - X^ x ) 
geA L J t 


where 


(3.4) 


£(d) := argminC(d, £) = 


V + P 


C{d) := C(d,i(d)) = l^fpd 2 =: \r(r,,P)d 2 . 


(3.5) 


The function in (3.5) can be interpreted as a reduced cost function. Because the production 


cost function and the penalization are both quadratic, they can be reduced to a single 
production function where the imbalances are internalized by the producer. 


By exploiting the linear-quadratic structure of the stochastic control problem (3.4), we 
can obtain explicit solutions for this auxiliary problem. 
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Theorem 3.1 The value function to ( |3.4| ) is explicitly equal to: 

r-t) + 7) 


v(t,x,y,d) = 


(r(v, /3) + u)(T - t) + 2 7 
T-t 


[(d, — x) 2 + 2 fj,(T — t)(d — x)) 


(r<„, „) + „) ( r - D ^ 1 (- T + r( ’'- mT - t)v) 


r(r},P)(T -t) 


-{d- x)y 


( r ( 7 7 ) P) + v)(T — t) + 27 

^0 + ^(q, ff) ~ 2 pcr 0 o- d r(r ? , ft) / (r(ry, ft) + t/)(T - t) 
(r(r),/3) + v) 2 ' 2 7 

^d r (.V, P)v + 2pa 0 a d r(ri, p ) - <rg _ 


+ 


+ 


2(r(r ? ,/3) + zy) 

r(7,ftV(T-t) 2 (§(T-f) + 7 ) 


(r(? 7 ,^)+ i/)(T-t)+ 27 
for (f, x, y,d ) 6 [0, T] xKxKx M, with an optimal trading rate given in feedback form by: 

•}t,d \rt,x,y,d -y-t,x,y,d} 

L s 

r{v,P)(pt + d)-y 


q s = q(T - s,Dl’ d - Xl’ x ’ y ’ d ,Y*’ x ’ y ’ d ), t<s<T 


q{t,d,y) : = 


( r (r), P) + v)t + 2 7 ’ 


(3.6) 


Here ( y X t ' x,v,d , Y t,x,y,d , D t,d ) denotes the solution to (2.1)-(2.7)-(2.8) when using the feedback 
control q, and starting from (x,y,d) at time t. Finally, the optimal production leverage is 
given by: 


& = i(D f p d - 4 


V 


V + P 


(D t /-X t ™’ d ). 


(3.7) 


Skech of proof. We look for a candidate solution to (3.4) in the quadratic form: 

w(t,x,y,d ) = A(T — t)(d — x) 2 + B(T — t)y 2 + F(T — t)(d — x)y 

+ G(T -t)(d-x) + H(T - t)y + I<(T - t), 

for some deterministic functions A. B , F. G, H and K. Plugging this ansatz into the 
Hamilton-Jacobi-Bellman (HJB) equation associated to the stochastic control problem (3.4), 
we find that these deterministic functions should satisfy a system of Riccati equations, which 
can be explicitly solved. Then, by a classical verification argument, we check that this ansatz 
w is indeed equal to v, with an optimal feedback control derived from the argmax in the 
HJB equation. The details of the proof are reported in Appendix. □ 


Remark 3.1 (Pure trader) By sending P to infinity in the expression of the value function 
v and of the optimal feedback control q, and observing that r(r], (3) goes to 77 , we obtain the 
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solution to the optimal execution problem (2.5) without leverage production: 


v NP (t,x,y,d ) := inf E / q s (Y*’ y + ~/q s )ds + C(D^ d - X^ x , 0) 

q eA L J t 


?? (|(T-t)+ 7 ) 

(rj + u)(T - t) + 2 7 
T-t 

+ 

+ 


+ 7 


(r/ + ^)(T —t) + 27 

v(T-t) 

(V + v)(T - t) + 27 
+ ^J 7 ? 2 - 2pa 0 a d r/ 


((d — x ) 2 + 2p(T — t)(d — x)) 
( - y + W(T - t)y) 

(d — x)y 

(v + v)(T — t) 


+ 


+ 


(v + v) 

a 2 d yv + 2pa 0 a d r] - 


In 1 + 


2 7 


(T-t) 


2(7 + v) 
r ?M 2 (T-t) 2 (|(T-t)+ 7 ) 


(3.8) 


(rj + v) (T-t) + 2 7 

for (f, x, y,d) E [0, T] xlxlx M, with an optimal trading rate given in feedback form by: 

€ 

q NP (t,d,y) : = 


= q- p (T - s,Di’ d - X t s ’ x ’ y ' d 1 Y*’ x ’ y ’ d ), t < s <T 
+ d) -y 


(r? + v)t + 2 y‘ 


□ 


Interpretation: 


1. The optimal trading rate q s at time s E [t, T], given in feedback form by (3.6), is 
decomposed in two terms: the first one 


r(y,/3) 


(r(ri, P) + v){T - t) + 27 


(/i(T - s) + 


is related to the trading rate in order to follow the trend of the demand, and to the 
incentive to invest when the forecast of the residual demand is larger than the current 
inventory. The second term 

_^_ -Crt,x,y,d 

(r(rt,P) + v){T-t) + 27 s 

represents the negative impact of the quoted price on the investment strategy: the 
higher the price is, the more the agent decreases her trading rate until she reaches 
negative value meaning a resale of electricity shares. These effects are weighted by 
the constant denominator term depending on the penalty factor 7 , the marginal cost 
production factor /?, the temporary and permanent price impact parameters 7 , v, and 
the time to maturity T — t. 
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2. By introducing the marginal cost function: d (x) = fix, and the process 


t ■■= -^-s(D t ’ d + ^T-s)-X t s ’ x ^ d -q s (T-s)), t<s<T, 

r? + p x 

which is interpreted as the forecast production for the final time T (recall expression 
of the final production), we notice from the expression of the optimal trading 
rate that the following relation holds: 

Y s t ’ x ’ y ’ d + vq s (T-s) + 2 1 q s = c'(| s ), t<s<T. (3.9) 

This relation means that at each time, the optimal trading rate is to make the forecast 
intraday price plus marginal temporary impact (left hand side), which can be seen as 
the marginal cost of electricity on the intraday market at time T, equal to the forecast 
marginal cost of production. Here, the instantaneous impact 7 appears as a marginal 
cost of buying or selling, and the forecast at time s supposes that the optimal trading 
rate q s is held constant between s and T. □ 


(3. 


We complete the description of the optimal trading rate by pointing out a remarkable 
martingale property. 


Proposition 3.1 The optimal trading rate process ( q s )t<s<T hi (3.6) is a martingale. 

Proof. By applying Ito’s formula to q s = q(T — s, D 1 / 1 — 
since q is linear in d and y, we have: 


V 

1 1 S 


, t < s < T, and 


dq s = [ - + (/U - q)|| + (IT - S , 


dt 

dq 


+ ^{T-s, - X$*>v> d , Y*’ x ’ y ’ d )cr d dB 8 

+ tt~(T — s, DY - Xl’ x ’ y ’ d , Y*’ x ’ y ’ d )a 0 dW s , 
dy 


from the dynamics (2.1), (2.8), and (2.7) of X t ' x,y,d , D t,d and Y t,x,y,d . Now, from the explicit 

,dq 


expression of the function q(t, y, d), we see that 


d Q,( 7 ^, - 

-m + u ‘- 9) dd + , ' q dy 


= 0 , 


and so: 


dq s = 


r{ri,(3)v d 


(r(ri,P) + i/)(T-s) + 27 
which shows the required martingale property. 


dB, - 


co 


(r(ri,P) + i/)(T-s) + 27 


dW 8 


(3.10) 

□ 


Remark 3.2 Recall that in the classical optimal execution problem as studied in |2], the 
optimal trading rate is constant. We retrieve this result in their framework which corres¬ 
ponds to the case where a d = 0 (constant demand target), /? = 00 (there is no production), 
and rj = 00 (constraint to lead Xt to the fixed target), see Remark 2.1 4). Indeed, in 
these limiting regimes, we see from (3.10) that dq s = 0, meaning that {q s ,t < s < T} is 
constant. In our framework, this is generalized to the martingale property of the optimal 
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trading rate process, which implies that the optimal inventory {X 

m[x t s ’ x ’ y ’ d - 


t.x.y.d 
s 5 


constant growth rate in mean, i.e. 
time t given by q(T — t,d — x,y). 


ds 


t < s < T} has a 
is constant equal to the initial trading rate at 


As a consequence of this martingale property, if the producer already satisfies the relation 


(3.9) in the day-ahead market, and if the initial intraday price is the day-ahead price, her 


initial trading rate on the intraday market will be zero. And thus, on average, her trading 
rate will be zero. 

The martingale property of the trading rate proce ss is actually closely related to the 
martingale dynamics of the unaffected price P in (2.6). As we shall see in Section [d] where 


we consider jumps on price, making P a sub or super martingale, the optimal trading rate 
will inherit the converse sub or super martingale property. □ 


3.2 Approximate solution 

We go back to the original execution problem with the non negativity constraint on the 
production quantity. As pointed out above, there is no explicit solution in this case, due 
to the form of the terminal cost function C + . The strategy is then to use the explicit 


control consisting in the trading rate q derived in (3.6), and of the truncated nonnegative 
production quantity: 


:= 


€t> o 


= )> 


(3.11) 


with defined in (3.7) from the auxiliary problem. In other words, we follow the trading 


rate strategy q determined from the problem without constraint on the final production 
quantity, and at the terminal date use the production leverage if the final inventory x!jf' ,y ’ d 
is below the terminal demand D^ d , by choosing a quantity proportional to this spread 
D^ d — X^ x,y,d . The aim of this section is to measure the relevance of this approximate 
strategy (g, ££) E A X with respect to the optimal execution problem (2.9) by 

estimating the induced error: 

£i{t,x,y,d) := J(t,x,y,d\q,lj) - v(t,x,y,d), 

for (t, x, y,d) E [0, T] xRxlx M. We also measure the approximation error on the value 
functions: 


S 2 (t,x,y,d) := v(t,x,y, d) — v(t, x, y, d). 

Notice that if > 0 a.s., i.e. Dj, d > x^ x ’ y ’ d a.s. (which is not true), and so ^ = £r, 
then clearly (q, £t) would be the solution to (2.9), and so E\(t,x,y,d) = £ 2 {t,x,y,d) = 0. 
Actually, these errors depend on the probability of the event: {Xj. x,y,d > Dj?}, and we 
have the following estimate: 


Proposition 3.2 For all ( t,x,y,d ) E [0, T] x M x M x M, we have 

'm{T — t, d — x,y) 


0 < f.V.x.M) < ^V(T- 


s/V(T-t) 


), * = 1,2, (3.12) 


where 


:= (z 2 + 1 ) 3 >(— z) — zcj>(z), z£ 
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(3.13) 


with 4> = <h / the density of the standard normal distribution, and 

(ut + 2 y )(pt + d) + yt 


m(t, d, y) := 


V(t) : = 


(riji, p) + v)t + 27 


rt <TqS 2 + a 2 d (vs + 27) 2 + 2pa 0 a d s(vs + 27 ) 
[(r(r/,^) + z/)s + 2 7 ] 2 


ds > 0 . 


(3.14) 


Proof. By definition of the value functions v and v, recalling that (q,£t) is an optimal 
control for v, and since (q,^) G A x L°+(Ft), we have: 

J(t,x,y,d;q,£ T ) = v(t,x,y,d) < v{t,x,y,d ) < J(f, x, y, d; g, &), 

for all (t,x, y,d) G [0,T]xKxRx M. This clearly implies that both errors £1 and £2 are 
nonnegative, and 

max(£i(f,x,y,d),£ 2 (f,a;,y,d)) < £(t,x,y,d ) := J(f, x, y, d; q, &) - J{t, x, y, d; q, £r)- 


We now focus on the upper bound for £. By definition of J in (2.10), and in (3.7) 
and (3.11), we have 


£(t, x, y, d) = E C(D^ d - Ct) - C'(^>T a - Xj, x ’ y ’ a , £r) 


t,x,y,d - 


-\t,d 


= E 


C(£»^ - £ + (£>r a - 


'rt,x,y,d 


- C{D% d - X z ™' a , {(D 1 * - X l ™' a )) 


t,x,y,d 

t,d Y^t,x,y,d^ 
t,x,y,d £/ r-\t,d Y~t,x,y,d 


= E 


C + {D t / ~ X^ x,y ’ d ) - C(Dj, d - X% e ’ y ' a ) 


rt,x,y,d\ 


yr{r),p) 

2/3 


E 


(D t /-X% B,v,d ) 2 * D t. 


d _x% :c ’y'd < o 


(3.15) 


from the definitions and expressions of C + and C in (2.3), (3.3) and (3.5). Now, from (3.10) 

licit (path- 

r(v,/3)cr d 


and by integration, we obtain the explicit (path-dependent) form of the optimal trading 
rate control: 


= qt + 


s; 

1 


( r iv,P) + v)(T-u) + 27 
_ 07 _ 

t (r(r},P) + v)(T-u) + 2 y 


dB u 

dW u , t < s < T, 


with qt = q(T — t,d — x,y). We then obtain the expression of the hnal spread between 
demand and inventory: 


j-\t,d -£rt, x ,y,d 

JLArj-1 y\. r~p 


= d - x + p(T -t)+[ a d dB s - [ q s ds 


't J t 

rT 


°d(v(T - s) + 27 ) 
t (r(ri,P) + v)(T-s) + 2j 
T a 0 (T-s) 


= m(T — t,d — x,y) + 

+ 


dB, 


It (r(rj,P) + v)(T-s)+ 2 y 

by Fubini’s theorem, and with 

m(t,d,y ) := d + pt — tq(t,d,y), 


dW„ 
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which is explicitly written as in (3.13) from the expression (3.6) of q. Thus, D^ — X^ x,y ’ d 
follows a normal distribution law with mean m(T — t, d — x,y) and variance V(T — t) given 

'm(T — t,d — x,y)' 


by (3.14), and from (3.15), we deduce that 


r,. n vr(r],P) 

£{t,x,y,d ) = ——— V(T — 


y/V(T-t) 


while the probability that the final inventory is larger than the terminal demand is: 

f [4'-< o] = *( MT ; v t ^ s) ) 


(3.16) 

□ 


Error asymptotics. We now investigate the accuracy of the upper bound in (3.12) 

"m(T — t,d — x,y)' 


£(T — t,d — x,y) TMLv<? - 


It is well-known (see e.g. Section 14.8 in [10j ) that 

z4>(— z) < 4>(z), \/z G 


y/V(T-t) 


(3.17) 


from which we easily see that ijj is non increasing, convex, and ip^oo) = 0. Thus, £(T — 
t,d — x, y ) decreases to zero for large m(T — t,d — x;y) or small V(T — t). We shall study 
its asymptotics in three limiting cases (i) the time to maturity T — t is small, (ii) the initial 
demand spread d — x is large, (iii) the initial quoted price y is large. We prove that the 
error bound £{T — t,d — x, y), and thus £\(t, x, y, d), £z{t, x, y, d), converge to zero at least 
with an exponential rate of convergence in these limiting regimes: 


Proposition 3.3 (i) For all ( x,y,d ) GMxKxK with d > x, we have 

limsup(T — t) Yn£(T — t, d — x, y) < —( - ^ 

T-t \o 2 V a d 


(3.18) 


(ii) For all (t, y) G [0, T) x M, we have 

1 


lim sup 

d—x—> oo 


(d — x ) 2 


In £(T — t,d — x,y) < — - 


1 mUT-t) 


2 V(T — t) ’ 


where 


mooit) = 


vt + 2 y 


(r(rj, (3) + v)t + 2 q 


(iii) For all ( t , x, d) G [0,T)xRx M, we have 


where 


1 — 1 n? (T — 

lim sup —2 Inf (T — t,d — x,y) < ——, 

y^o O y 2 V{T — t) 


noc(t) = 


(r(ri,l3) + u)t + 27 ' 


(3.19) 


(3.20) 
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Proof. From (3.17), we have: 


0 < 'ip(z) < z <f>(z), Vz > 0. 

Notice that in the three asymptotic regimes (i) (with d—x > 0), (ii), and (iii), the quantity 
m(T — t,d — x,y) is positive, and we thus have: 


£(T — t,d — x,y) < 


r)r(r),(3) V(T—t) 2 
2(3 m(T — t,d — x,y) 


m(T — t,d — x,y) 
y/V(T — t) 


(3.21) 


(i) For small time to maturity T — t, we see that m(T — t, d — x,y) converges to d — x > 0, 
while V(T — t) ~ a^(T—t), i.e. V(T — t)/a^(T — t) converges to 1. This shows from (3.21) 
that, when T — t goes to zero, the error bound £(T — t,d — x, y ), converges to zero at least 
with an exponential rate of convergence, namely the one given by ( |3. 18 ). 

(ii) For large demand spread d — x, we see that m[T — t,d — x,y) ~ — t)(d — x), i.e. 

the ratio m(T — t,d — x, y)/m 00 {T — t)(d — x) converges to 1 when d — x goes to infinity. 


This shows from (3.21) that, when d — x goes to infinity, the error bound £(T — t,d — x, y), 
converges to zero at least with an exponential rate of convergence, namely the one given by 


(3.19). 


(iii) For large y, we see that m(T — t, d — x,y) ~ n^T — t)y, i.e. the ratio m(T — t,d — 


x, y)/noo(T — t)y converges to 1 when y goes to infinity. This shows from (3.21) that, when 
d — x goes to infinity, the error bound £(T — t, d — x,y) converges to zero at least with an 
exponential rate of convergence, namely the one given by (3.20). □ 


Interpretation. Recall from (3.16) that 


p = «>(- ! 5 ^=p 1 ) 


and thus following the same arguments as in the above proof, we have: 

(i) 

limsup(T — t) kiPT-Dj^ < X t J c,y ’ dl \ = — ( - -V , 

T -40 2\ ad > 


(3.22) 


for all (x,y,d) GRxKxR with d > x. We observe that the rate in the rhs of (3.18) 


(or (3.22)) depends only on the demand volatility ad and the initial demand spread 
d — x. Moreover, it is all the larger, the smaller cr^ is, and the larger d — x is. This 
means that the terminal demand will stay with very high probability above the final 
inventory once we are near from the maturity with a low volatile demand, initially 
larger than the inventory, in which case, the explicit strategy (< 7 , ££) approximates 
very accurately the optimal strategy (^ q *,££). 


lim sup — 

d—X^OG 


1 


- X) 


jlnP [D^ d < X^ y ’ d ] 


1 ml(T-t) 

2 V(T — t) '' 


(3.23) 


for all (t,y) G [0, T) X M, The rate in the rhs of (3.19) (or (3.23)) is all the larger, the 
smaller the volatilities a$ and of the electricity price and demand are. Again, we 
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have the same interpretation than in the asymptotic regime (i), and this means that 
the explicit strategy (q, ££) approximates very accurately the optimal strategy (q*,^ t) 
in the limiting regime when the initial demand spread is large, and the volatilities are 
small. 


(hi) 

limsup \ In P [D^ d < X^ x ’ y,d ] = ^ , (3.24) 

y -> oo y z 2 V(T — t) 

for all [t, x, d) £ [0,T)xKx M. In the limiting regime where the initial quoted price y 
is large, the agent has a strong incentive to sell energy on the intraday market, which 
leads to a final inventory staying under the final demand with high probability, and 
thus to a very accurate approximate strategy (q, £^). As in case (ii), this accuracy is 
strengthened for small volatilities do and a d of the electricity price and demand. □ 

3.3 Numerical results 
3.3.1 Numerical tests 

We measure quantitatively the accuracy of the error bound derived in the previous paragraph 
with some numerical tests. Let us fix the following parameter values: (Jo = 1/60 €-(MW ) _1 • 
s” 1 / 2 , a d = 1000/60 MW-s -1 / 2 , /3 = 0.002 €-(MW)“ 2 , r/ = 200 €-(MW)" 2 , /i = 0 MW-P 1 , 
v = 10~ 10 €-(MW) -2 , 7 = 10 _ 10 €-s-(MW ) _2 and p = 0.8. 

We start from the initial time t = 0, with a zero inventory Xq = 0, and vary respectively 
the maturity T, the initial demand Dq and the initial price lo. We compute the probability 
for the final inventory to exceed the final demand P [Xt > Dt\, the approximate value 
function h(0, Xq, Yq, Dq), and the error bound £ (T, Dq — Xq, lo). The results are reported 
in Table [l] when varying T, in Table [2] when varying Dq and in Table [3] when varying Yq. 


T(h) 

PfA/r > Df] 

v(0,Xq,Yq,Dq) (€) 

£(T,Dq-Xq,Yq) (€) 

1 

< 10~ 16 

1.88 x 10 6 

< lO ” 16 

8 

< 10“ 16 

1.88 x 10 6 

< 10“ 16 

24 

< 10" 16 

1.89 x 10 6 

4.16 x 10 -12 

50 

7.72 x 10 ~ 13 

1.90 x 10 6 

2.48 x 10 “ 4 


Table 1: Yq = 50 €-(MW) 1 and Dq = 50000 MW 


Table 1 shows that for time to maturity less than T = 24h, the probability for the final 
inventory to exceed the final demand is very small, and consequently the error bound is 
rather negligible. When the time horizon increases, the agent has the possibility to spread 
over time her trading strategies for reducing the price impact, and purchase more energy, 
in which case the probability for the final inventory to exceed the demand increases. 

Table 2 shows that the probability for the final inventory to exceed the final demand, 
and the error bound are not much sensitive to the variations of the initial positive demand 
Dq. Actually, the main impact is caused by the initial stock price, as observed in Table 3. 

For small initial electricity price Yq, the agent will buy more energy in the intraday 
market and produce less. Therefore, the inventory will overtake with higher probability the 
demand, in which case the approximate value function can be significantly different from 
the original one, as observed from the error bound in Table 3 for Yq = 20. 
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D 0 (MW) 

IP [At 1 > Dt] 

v(0,X 0l Y 0 ,Do) (€) 

£{T,D 0 -X 0 ,Y 0 ) (€) 

500 

< 10" 16 

-5.86 x 10 5 

4.16 x 10~ 12 

5000 

< 10" 16 

-3.62 x 10 5 

4.16 x 10' 12 

50000 

< 10 -16 

1.89 x 10 6 

4.16 x 10“ 12 

500000 

< 10“ 16 

2.44 x 10 7 

4.16 x 10~ 12 


Table 2: T = 24 h and Y 0 = 50 C^MW)' 1 


Y 0 (€-(MW) 1 ) 

PfAj 1 > Dt] 

h(0,Ao,Y 0 ,D 0 )(€) 

£{T,D 0 -X 0 ,Y 0 ) (€) 

500 

< 10“ 1(j 

2.51 x 10 s 

< 10“ ie 

50 

< 10" 16 

1.89 x 10 6 

4.16 x 10~ 12 

40 

9.51 x 10" 15 

1.61 x 10 6 

3.80 X 10" 4 

30 

4.57 x 10 -10 

1.29 x 10 6 

1.30 X 10 -2 

20 

2.23 x 10“ 5 

9.13 x 10 5 

1.26 x 10 3 


Table 3: T = 24 h and D 0 = 50000 MW 


3.3.2 Simulations 

We plot trajectories of some relevant quantities that we simulate with the following set of pa¬ 
rameters: do = 1/60 €'(MW)“ 1 -s' 1 / 2 , ad = 1000/60 MW-s -1 / 2 , /3 = 0.002 €-(MW)~ 2 , r/ = 
100 €-(MW)“ 2 , p = 0 MWT 1 , p = 0.8, v = 4.00 X 10- 5 €-(MW)“ 2 , 7 = 2.22€-s-(MW)~ 2 , 
T = 24h, X Q = 0, D 0 = 50000 MW and Y 0 = 50 e(MW) _1 . 

For such parameter values, the probability P[Xt > Dt\ is bounded above by 10 -16 , the 
error £{0, Dq — Xq. Yq) is bounded by 2.82 x 10~ lo €, and 


v(0,X o ,Y 0 ,D o ) = 1916700€. 


The executed strategy (q, ££,) can then be considered as very close to the optimal strategy. 


Figure [I] represents the evolution of the trading rate control (qt)te[o,T\ derived in (3.6) for 


a given trajectory of pric e an d demand, and this is consistent with the martingale property 
as shown in Proposition |3.l[ Figure [2] represents a simulation of the quoted price Yj with 
impact and of the unaffected price Pt■ Due to the buying strategy, i.e. positive q, we 
observe that the quoted price Y is larger than P. In Figure |3j we plot the evolution of the 
optimal inventory (Xt)t.&[ o,T), an< ^ °f the forecast residual demand (Th) tg [o 77 We see that 
Xt is increasing, with a growth rate which looks constant as pointed out in Remark 3.2 At 


final time, if Xt < Pt (which is the case in our simulation), the agent uses her production 
leverage an< 4 achieves a fi nal i nventory: Xt + /r ; which is represented by the peak at 
time T. From the expression (3.7) of the final imbalance cost is equal to 


Dt — Xt — = 


and is then positive, as shown in Figure [3j 




ri + P 


{Dt — Xt), 
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(MW) Price (euro/MW) Rate of buying/selling (MW 



Figure 1: Evolution of the trading rate control q 



Figure 2: Simulation of the quoted impacted price Y and of the unaffected price P. 



Figure 3: Evolution of the inventory X and of the forecast residual demand D. 
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4 Jumps in the residual demand forecast 


In this section, we incorporate the case where the residual demand forecast is subject to 
sudden changes induced by prediction errors on renewable production, which may be quite 
large. Our aim is to study the impact on the strategies obtained in the previous section, 
and we shall also neglect the delay in thermal plants production. 

The sudden changes in the demand forecast are modeled via a compound Poisson process 
Nt = with intensity A > 0, where lV t + is the counting process associated to 

positive jumps of the demand forecast with size (5 + > 0, occurring with probability p + E 
[0,1], while Nf is the counting process associated to negative jumps of the demand forecast 
with size 5~ < 0, occurring with probability p~ = 1 —p + . We denote by 6 := 5 + p + + 5~p~ 
the mean of the jump size of the demand forecast. The dynamics of the residual demand 
forecast D is then given by: 


dD t = pdt + addB t + S + dN^~ + S dN t 


(4.1) 


where we add a jump component with respect to the model in (2.8). Moreover, as soon as 


a jump in the residual demand forecast occurs, this is impacted into the intraday electricity 
price since the main producers are assumed to have access to the whole updated forecast. 
We thus model the unaffected electricity price by: 


Pt = P 0 + a 0 W t + TT+N+ +7T-W 


t ? 


(4.2) 


where we add with respect to the Bachelier model in (2.6) a jump component of size 7r + 
> 0 (resp. n~ < 0) when the jump on residual demand is positive (resp. negative), which 
means that a higher (resp. lower) demand induces an increase (resp. drop) of price. We 
denote by 7r := TT + p + + TT~p~ the mean of the jump size of the intraday price. Given a 
trading rate q E A, the dynamics of the quoted price Y is then governed by 


dY t = uq t dt + (Tq dWt + ir + dN^~ + 7r dN t . (4-3) 


By considering this simplified modeling of demand forecast subject to sudden shift in 
terms of a Poisson process, we do not have additional state variables with respect to the 
no jump case of the previous section. Let us then denote by v = v^\t,x,y,d) the value 
function to the optimal execution problem (2.4) with cost functional J = x, y, d, q, £), 

where we stress the dependence in A for taking into account jumps in demand forecast. The 
value function in the no jump case derived in the previous section is denoted by v = v^°\ 
As in the case with no jumps, there is no explicit solution to v^ due to the non negativity 
constraint on the final production: we shall first study the auxiliary execution problem 
without sign constraint on the final production, then provide an approximate solution to 
the original one with an estimation of the induced error approximation, and with some 
numerical illustrations. We compare the results with the no jump case by focusing on the 
impact of the jump components. 


4.1 Auxiliary optimal execution problem 


Similarly as in Subsection |3.1[ we consider the optimal execution problem without non 
negativity constrai nt on the final production, denoted by v = v^Y(t,x,y,d). 

As in Theorem 3.1 for the case of the value function without jumps, we have an 
explicit solution to this auxiliary problem. 
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Theorem 4.1 The value function to the auxiliary optimization problem is explicitly given 
by: 


v {X \t,x,y,d ) 

= v {0 \t,x,y,d) 

A r(r], (3)(T — t)(jt(T — t) + 25{v(T — t) + 2 7 )) 

2 (r(r,,P) + v)(T-t) + 2 7 ^ 

A (T -t) 2 (n -2r(r],/3)5) 

2 (r(r ? ,/?) + zz)(T-t) + 2 7 2/ 

p + (7r + — r(rj, (3)5 + ) 2 + p~(tt~ — r(r), /3)5~) 2 ( (r(rj, /3) + v){T - t) 

+A 7 -;- to -hi H--- 

(r(ri,/3) + v) ' 2 7 


+ 


\p + ((ir + ) 2 — r(rj, j3)5 + (2-n + + v5 + )) + p~ ((tt - ) 2 — r(ry, /3)S~(2tt^ + zM - )) 
2 r(rj, fi) + v 

\r(rji (3) 2 up5 + A((p + ) 2 < 5 + ( 7 r + + ud + ) + {p~) 2 5~ (ir~ + u5~)) 2 


(T-t) 


r(rj,/3) + v 

2 r(ry, /3)<5 2 + 2zzp+p - <5+<5 - — ((p + ) 2 5 + 7 r + + (p _ ) 2 ( 5 _ 7 r _ ) 2 


+A 7r , (77,/3) 
+ 


(r( 77 , /3) + i/) ((r(ry, /3) + i/)(T - t) + 2 7 ) 
2A 7 r 2 ( 7 , /3)fj,5 


(T-ty 
(T-ty 


_ (T - t) 2 - ^ n (T - t) 3 

(r(ri,P) + i/)((r(r},P) + i/)(T-t)+ 2 7 ) V 48 7 V 


A 2 p+p r( 7 ,/3) 2z/£+<5 + (5 tt + + <5+tt _ 3 

2 ( r (n,P) + v)(T-t) + 2 7 l j 

1 4r( 7 , [3)p\7r — A 2 7 t 2 _ 3 

+ 8(r(r ? ,/3) + zz)(T-t) + 2 7 l j 1 


for (t, x, y,d ) E [0, T] xlxlx M, with an optimal trading rate given in feedback form by: 

qW = qW(T- S ,D t s ’ d -X t s ’ x ’ y ’ d ,Yt’ x ’ y ’ d ), t<s<T 

4 A-)/ , \ Af 0 w. , v r(rj,p)dt+ §}(r(ri,P) + v)t 2 

q y Ut,d,y ) := q °Ht, d, y) + A-———A—- 

q 1 v (r( 7 , /3) + v)t + 2 7 


= g (0) (t, d + X5t, y + ^irt) + ^-t, 


(4.4) 


where q is the optimal trading rate given in (3.6) in the ca se w it h no j ump in the demand 
forecast. Here Y t ' x,y ' d , D t,d ) denotes the solution to (2.1)-(4.3)-(4.1) when using the 

feedback control q^ x \ and starting from ( x,y,d ) at time t. Finally, the optimal production 
quantity is given by: 


& A) = -JV—fDp-xF’V’*). ( 4 . 5 ) 

r) + p y ’ 

Proof. See Appendix. □ 

Interpretation. The expression of the optimal trading rate qi X \ s E [t, T], as 

7A) _ 70) r(r 1 ,f3)5(T-s) + ^(r(r l ,P) + v)(T-s) 2 

Qs Qs + (r(r,,f3) + u)(T-s) + 2 7 ’ 
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where q^ = q(°\T — s, Dt’ d — Xs’ x ’ y,d , Ys’ x,y,d ) represents the optimal trading rate that the 
agent would use if she believes that the demand forecast will not jump, shows that under 
the information knowledge about jumps, the agent will purchase more (resp. less) electricity 
shares and this impact is all the larger, the larger the intensity A of jumps, and the positive 
(resp. negative) mean 6 and it of jump size in demand forecast and price are. On the other 
hand, the expression of as the sum of two terms: 

qW = q^(T-s,D t / + \S(T-s),Y s t ’ X ’ V ’ d +^(T-s)) + ^(T - s), (4.6) 


can be interpreted as follows. The first term is analog to the optimal trading rate in the no 
jump case, with an adjustment A 5(T — s ) in the demand, which represents the expectation 
of the demand jump size up to the final horizon, and an adjustment ^ 7 r(T — s) on the 
price, which represents half of the expectation of the price jump size up to the final horizon. 
The second term, ^(T — s), is deterministic, and linear in time, and we shall see on the 
simulations for some parameter values that it can be dominant with respect to the first 
stochastic term. Moreover, notice that the equilibrium relation (3.9) in the no jump case 
between forecast intraday price and forecast marginal cost of production does not hold 
anymore in the presence of jumps, except at terminal date T: 


Yt,x,y,d 

1 T 


+ 27 q ( T ] 




(4.7) 


□ 


The unaffected price P in (4.2) is no more a martingale in presence of jumps, except 
when 7r = 0. It is actually a supermartingale when 7r < 0 (predominant negative jumps), 
and submartingale when 7r > 0 (predominant positive jumps). The next result shows that 
the optimal trading rate inherits the converse submartingale or supermartingale property 
of the price process. 


Proposition 4.1 The optimal trading rate process (qs^)t<s<T in (4.4) is a supermartingale 
if 7r > 0, and a submartingale if 7r < 0. More precisely, the process {qi^ + |^(s — t),t < 
s < T} is a martingale. 


Proof. Notice that is a Poisson process with intensity A ,p±, and let us introduce the 
compensated martingale Poisson process Nj~ = Nf — A p±t. By applying Ito’s formula to 
the trading rate process qi^ = gi A i(T — s, D 1 / 1 — Yg’ x ’ v,d ), t < s <T, and from the 
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dynamics (2.1), ( |4.1[ ) and (4.3), we have: 


dqW = 


+ 


+ 


+ A p + (q {X) (.,. + 5 + ,. + vr+) - g (A) ) 

+ A p~ (g (A) (., . + 5-,. + ir~)- gW)l (T - s, DY - jt*™’ 4 , Y*' x ’ y ' d )ds 

dqW 


dd -(T~ ^ D s ~ Xl’ x ’ y ’ d , Yg ,x,y,<1 )a d dB s 


dq™ 

dy 


(T — s, D^ d — X 


rt,x,y,d yt,x,y, d 


)a 0 dW s 


+ [g (A) (T - s, D + 5 + - Xl’ x ' y ’ d , Y^ x ’ y ’ d + vr+) 

- q w (T - s, D^ d - X^’V’ 4 , Y*L x ’ y ’ d )] dN+ 
+ [gW(T - s, D^ d + 5~ - Xl’ x ’ y ’ d , Y*! x,v,d + vr") 

- g (A) (T - s, - Xl**’*’ 4 , Y*L x ’ y ' d )\ dN~. 


Now, from the expression (4.4) of q^'(t, d, y), we see that: 

dqW . „/ A ).dq^ 

+A(p + g( A )(.,. + <5 + ,. + 7r + ) + p _ g (A i(., • + S ~,. + tt~) - g (A) ) = - 


A7T 

w 


and then: 


dqX = — — ds 

27 


+ 


r(y,/3)cr d 


+ 


(r(ri,P) + v)(T-s) + 2 7 
r(rj, (3)5 + — n + 


dB, - 


00 


-dN+ + 


(r( 7 ,/3) + ^)(T-s) + 2 7 
r(r/, /3)h — 7T 


(r(? 7 ,^) + i/)(T-s) + 2 7 s (r(r ? ,/3) + z/)(T-s) + 2 7 
This proves the required assertions of the proposition. 


dW s 

dN-. (4.8) 

□ 


Remark 4.1 The above supermartingale (or submartingale) property implies in particular 
that the mean of the optimal trading rate process (q s ^)o<s<T is decreasing (or increasing) 
in time, and so that the trajectory of the optimal inventory mean E[Xs’ x,2/,rf ], 0 < s < T, is 
concave (or convex). Moreover, from the martingale property of q s ^' + 0 < s < T, we 

have: E[gi^] = ^ A 1(T, d — x,y) — ^s for 0 < s < T. Fix d, x, y, and let us then denote by 
s( A i := l^gi A i(T, d — x,y), which is explicitly written as: 


T 1 (r(??, P)n + A(r(ry, p)5 - f j)T + r(rj, fd){d - x) - y 

2 + \it 11 ( r (v,P)+’') T 

1 + 2 7 


We have the following cases: 
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< 0 and 7r > 0: this may arise for large y, or d « x, or r(rj, (3)5 « 7t/ 2. In this 

dR[x°™’ d ] _ 


extreme case, 


0,x,y,dj 


- = E[qs ] < 0 for 0 < s < T, i.e. the trajectory of E[A, 

0 < s < T, is decreasing, which means that the agent will “always” sell electricity 
shares since she takes advantage of high price, in order to decrease her inventory for 
approaching the demand, and because in average, the jump size of the demand is 
much lower than the positive jump size of the price. 


s < 0 and 7r < 0: this may arise for small y. or d » x, or r(r), /3)5 » ir/2. In this 


extreme case. 


rfE[A~; 


0,x,y,d-\ 


= E[^ A *] > 0 for 0 < s < T, i.e. the trajectory of E[Xs’ x ’ y,a ] 
0 < s < T, is increasing, which means that the agent will “always” buy electricity 
shares since she takes advantage of low price, in order to increase her inventory for 
approaching the demand, and because in average, the jump size of the price is much 
lower than the jump size of the demand. 


ds 


0 ,x,y,d-i 


• s^ > T and 7r > 0: this may arise for r(r],(3)5 » n/2, d » x or small y. In this 
other extreme case, the trajectory oiK[Xs' x,y ' c \ 0 < s < T, is increasing, which means 
that the agent will “always” buy electricity shares at low price in order to approach 
the residual demand at final time. 


• s( A ) > T and 7r < 0: this may arise for r(r],f3)5 « 7t/2, d « x or large y. The 
trajectory of K[Xg X,V ], 0 < s < T, is decreasing, which means that the agent will 
“always” sell electricity shares at high price in order to approach the residual demand 
at final time. 


0 < s (A) < T: in this regular case, it is interesting to comment on the two subcases: 

— if 7r > 0, the trajectory of s i —> E[Xs’ x ’ y,<i ] is increasing for s < and then 
decreasing for < s < T. This means that the agent starts by purchasing 
electricity shares for taking profit of the positive price jumps (which have more 
impact than the negative price jumps as p + ir + + p~n~ > 0), and then resells 
shares in order to achieve the equilibrium relation (4.7). 

— if 7r < 0, i.e. the negative jumps have more impact than the positive ones: the 
agent starts by selling electricity shares and then purchases shares. 


□ 


4.2 Approximate solution 


We turn back to the original optimal execution problem with the non negativity constraint 
on the final production, and as in Sect ion |3.2| we use the approximate strategy consisting 
in the trading rate q^ derived in (4.4), and of the truncated nonnegative final production: 


ttW*_ 

^-( A )> 0 


= i + (D^ d -Xj 


with given in (4.5). We measure the relevance of this strategy edx 

by estimating the induced error: 

£[ X \t,x,y,d) := “ v (x \t,x,y,d), 
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for (t, x, y,d ) E [0,T] xMxRx M, and also measure the approximation error on the value 
functions: 

£^ X \t,x,y,d) := v W (t,x,y,d) - v W (t,x,y,d). 

Proposition 4.2 For all ( t,x,y,d ) E [0, T] x M x M x M, we have 


0 < < rM>V(T-t)Eh' m 


2/3 




y/V(T — t) 


for i = 1,2, where ip, m, V are defined in Proposition 3.2 


m. (A) (t, d, y) = m(t,d,y + \(^-r(ri,P)6)t) 


(4.10) 


+ A 


t — 


2 7 


rpq, 3)5 — tt 
r(rj, (3) + v V rpq, (3) + v 


In 1 + 


r{r), (3) + v 
27 ' 


and 


_ /■ ^- W T-,) + 27 ) + 7-(T —,) £ 


Jt {r(r],P) + v)(T-s) + 2 7 
Proof. By the same arguments as in Proposition |3.2[ we have 

0< S\ x \t,x,y,d) < £ (x) {t,x,y,d ) := J (A) (t, x, y, d\ q W , ~ x, y, d; g (A) , 4 A) ), 

for * = 1,2, and 


£ {x \t,x,y,d) = 


yr(r},P) 

2/3 


E 


/nM vt,x,y,d\ 2^ 

) ,L D t ^ d -X/i x ’ y ’ c 


<0 


(4.11) 


for ( t,x,y,d ) 6 [0,T] xMxlxl, Now, recall from (4.8) that: 


= -A 

+ 

+ 


7r 
— + 


r(rj, (3)6 — tt 


ds 


0o 


2 7 (r(??,/3)+ i')(T-s)+ 2 7 J 

_ r(y,P)<Td _ dB _ 

(r(r?,/3) + z/)(r —s) + 2 7 s (i-(t/, /3) + i/)(T - s) + 2 7 
r(y,/3)5 + - 7T+ i r(r),P)6~ - tt 


(r(* 7 » 0 ) + i/)(T- s) + 27 


+ 


(r(? 7 ,^) + i/)(T- s) + 2 7 


dW s 

div s -, 


where we write the dynamics directly in terms of the Poisson processes N±. By integration, 
we deduce the (path-dependent) expression of qi X \ t < s < T: 

~(A) = -(A) Att \(r(i 1 ,P)5-n) / (r( 7 ,/3) + v)(T - s) + 2 7 \ 

t 2^y r(n,B) + u V (r(n. 3) + u)(T -1) + 2-y / 


+ 

+ 


/ 


2 7 v ?’(?y,/3) + i/ V(?’( 7 , /?) + i/)(T — t) + 2 7 . 

r(r),P)a d f s a Q 


dB u - 


l 


t (r(rj,P) + u)(T-u) + 2 7 J t {r{y, 3) + v)(T - u) + 2 7 
- 7T+ + r(r), P)S~ — tt~ 

,/3) + i/)(T-u) + 2 7 “ + J t 


(r(y, 


(r(ri,P) + i/)(T-u) + 2 7 
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with q^ = q( x \T — t,d — x, y). We thus obtain the expression of the final spread between 
demand and inventory: 


D t /-X$ B ’ v ’ d = d — x + n(T — t)+ a d dB s + 5 + dN++ 5~dN~ - 


i x) ds 


Jt 

ivS x \T — t,d — x,y) 

rT °d(v(T - s) + 2 y) 


+ 

+ 

+ 


-dB s + 


° o(T - s) 


't ( r ( r ),P) + v)(.T-s) + 2 7 J t ( r (ri,P) + i/)(T-s) + 2 7 

[ T j + Kr-s) + 2 7 ) + 7r+(r- 5 ) 

Jt {r(r], (3) + u)(T — s) + 2 7 
f T 5~ (: v(T — s) + 27 ) + tt~ (T — s) 


dW x 


it 


( r (v,P) + v)(T- s) + 27 


- dN: 


(4.12) 


by Fubini’s theorem, and where 


m^ x \t,d,y) := d + fit — tq^ (t, d, y) + — [ sds 

27 y 0 

A(r( 7 , /3)5 - tt) f* ^ f (r(rj, j3) + v)s + 27 


r{ji,P) + v J 0 \(r(rf,/3) + u)t + 2 7 


ds, 


is explicitly written as in (4.10) after some straightforward calculation. Denoting by A t jf’ y ’ d 


li¬ 
the continuous part of D^ a — X/p x,y,a consisting in the three first terms in the rhs of (4.12), 

and by E^’*, E r ’ f the jump parts consisting in the two last terms of (4.12), so that 

r^t,d y' t,x,y,d a t,x,y,d , ^,+,2 . ,t 

JLs rj-\ u\.rjn - / » rj-t ~J~ 2-Jrjn ~| 2-Jrjn ^ 

we notice that Ay X,y,<i follows a normal distribution law with mean (T —t,d — x, y) and 

variance V(T — t ), independent of Ey’ f . Then, conditionally on E^’*, D!^ d — X^ x ' y ' d follows 
a normal distribution law with mean 7n^ x \T — t,d — x,y) + E^’* + E^’*, and variance 
V(T — t), and this implies from (4.11) that: 


£ {x) (t,x,y,d) = 


< 


iyiv, P) 

2/3 

yr(rj, /3) 
2/3 


V(T-t) E 


V’ 


V(T - t)E 


m( x ^ (T — t, d — x, y) + E ^’ 4 + E T ’ f 
y/V(T — t) 

(T — t, d — x, y) + E, 


— ,t 

T 


\/V{T-t) 


since E^’* > 0 a.s. and i/j is non-increasing. 


□ 


Comments on the approximation error. Let us discuss about the accuracy of the 
upper bound in (4.9): 


£ {X ){T -t,d- x,y) := 


W'idi P) 

2/3 


V(T — t)E ip 


m 


W(T-t,d-x,y) + E 




\JV(T — t) 


First, notice that m/ x \T — t,d — x,y) + E^’ f ~ m(T — t, d — x,y) a.s. in the limiting 
regimes where T — t goes to zero, d — x or y goes to infinity. Therefore, by dominated 
convergence theorem, £( X \T — t, d — x,y) converges to zero in these limiting regimes as in 
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the no jump case. However, we are not able to derive an asymptotic limit as in the no 


jump case of Proposition 3.3 except when = 0, i.e. 5 = ir = 0, fo which we get 


the same asymptotic limit. Actually, in the presence of negative jumps on the demand, it 
is intuitively clear that our approximation should be less accurate than in the no jump case 
since the probability for the residual demand to stay above the final inventory is decreasing. 
Anyway, the explicit strategies {q^\i^’*) still provide a very accurate approximation of 
the optimal strategies at least in these limiting regimes, as illustrated in the next paragraph. 


4.3 Numerical results 

We plot trajectories of s ome re levant quantities that we simulate with the same set of pa¬ 
rameters as in Paragraph 3.3.2 gq = 1/60 6 -(MW ) _1 • s -1 / 2 , 07 = 1000/60 MW-s -1 / 2 , (5 = 
0.002 €-(MW) -2 , r? = 200 €-(MW)“ 2 , p = 0 MW-s" 1 , p = 0.8, v = 4.00 • 1(T 5 6-(MW)- 2 , 
7 = 2.22€-s-(MW)- 2 , T = 24h, A 0 = 0, D 0 = 50000 MW and Y 0 = 50 €/MW)-f More¬ 
over, we fix the probability of positive jumps, p + = 1 (then all jumps are positive: p~ = 0 ), 
and the following values for the jump components: A = 1.5/(3600 • 24) s -1 , 7 r + = 10 
6 /MW)- 1 , 5+ = 1500 MW. 

For such parameter values, we observe two occurrences of jumps on the trajectories of the 
demand of price. Moreover, the probability P[Xt > Dt\ is bounded above by 2.92 x 10 -16 , 
the error £i A )(0, Dq — Xq. Yq) is bounded by 2.66 X 10” 5 6, and 

v W (0,X 0 ,Y 0 ,D 0 ) = 20209506. 

The executed strategy (q^\Crp’*) can then be considered as very close to the optimal 
strategy. This has to be compared with the numerical result obtained in the previous 
section in the no jump case where we obtained a lower expected total cost: D(0, Xq, Yq, Dq) 
= 19167006. 

Figure |d] represents the evolution of the trading rate (<?^)te[o,Tp and we see that it 
is decreasing consistently with the supermartingale property in Proposition 4.1 Actually, 
we observe that the deterministic part in (4.6), which is linear in time, dominates the 
stochastic part. The interpretation of the strategy is the following: since positive price 
jumps are expected, the agent purchases a large number of shares in electricity with the 
hope to sell it later at a higher price thanks to the possible occurrence of a positive jump. 
At the price jump times, which can be visualized in Figure [5] we notice that the control 
q reacts by a decrease in the trading rate. The reaction to the second jump is more 
sensible than to the first jump since it occurs a short time before the final horizon T, where 


the objective is also to achieve the equilibrium relation (|4.7) between price and marginal 

the trajectory of the optimal 
This emphasizes the double 


cost. Finally, we observe clearly in Figure [6] the concavit y o: 
inventory process (JN^) te [o.T 1 )? as expected from Remark 4.1 
objective of the agent: on one hand, the purchase of electricity shares for taking profit of 
the positive price jumps, and on the other hand the resale of electricity shares for attaining 
the equilibrium relation between price and marginal cost at terminal date. We also plot the 
production at the final time T on Figure [6j and observe as in the no jump case that the 
imbalance cost Dt — Xt — £t is positive. 
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Price (euro/MW) Rate of buying/selling (MW - s') 


2.0 



Figure 4: Evolution of the trading rate control q^ 



Figure 5: Simulation of the quoted impacted price Y and of the unaffected price P 



Figure 6: Evolution of the inventory X and of the forecast residual demand D 


Next, we plot trajectories with the same set of parameters, but with p + = 0.3 (i.e. 
p~ = 0.7), 7T _ = —10 €-(MW) _1 , 5~ = —1500 MW. There are, in average, more negative 
than positive jumps. Now 


-,(A) 


(0, Xq, Yq, Dq) = 1756330€. 


Figure | t] shows that the trading rate (Qi^)t&[ 0 .T] is increasing, which is consistent with 
the submartingale property in Proposition 4.1 the deterministic part in (4.6) dominates 


the stochastic part. Since negative jumps are more expected than negative jumps are, the 


27 





















agent first sells a large number of shares in electricity with the hope to buy it later at a lower 
price thanks to the possible occurrence of jumps, that should be mainly negative. Here, 
the control reacts to the negative price jumps by an increase in the trading rate. Finally, 


m 


Figure [ 9 ] 
process, as expected from Remark 
on that figure. 


we observe the convexity of the trajectory of the optimal inventory (^i)ie[o,T) 

We also plot the production a t the final time T 


4.1 



Figure 7: Evolution of the trading rate control 



Figure 8: Simulation of the quoted impacted price Y and of the unaffected price P 



Figure 9: Evolution of the inventory X and of the forecast residual demand D 
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5 Delay in production 


In this section, we consider the more realistic situation when there is delay in the production, 
assumed to be fixed equal to h E [0, T], and we denote by v = Vh the value function to the 


associated optimal execution problem, as defined in (2.9), where we stress the dependence 


in the delay h. Our aim is to show how one can reduce the problem with delay to a suitable 
problem without delay, and then solve it explicitly. We shall consider the problem without 
jumps on demand forecast and price, but the same argument also works for the case with 
jumps. 


5.1 Explicit solution with delay 

For simplicity of presentation, and without loss of generality, we shall focus on the derivation 
of the value function Vh.(t, x, y, d ) for an initial time t = 0, and fixed ( x,y,d ) elxlx M. 
Given a control trading rate q E A, and from pathwise uniqueness for the solution to the 


dynamics (2.1), (2.7), (2.8), we observe that for any £ E L°(J-x-h) : 

x% x + t = x q T 


r T-h,X°’* h +Z 


a.s. 


V 0,y _ v T~h,Y?’\ 0,d _ rX~hyD' T _ h 

I rj-1 - I rji J J_Jrjn - rj-i 


-) 0 ,d 


(5.1) 


a.s. 


To alleviate notations, we shall omit the dependence in the fixed initial conditions (x, y, d), 
and simply write X s = Xg' x , Y s = Y®’ y , D s = Dg’ d , for s > 0, Vh = Vh( 0 ,x,y,d), and 


J(0;g,£) = J(0,x,y,d',q,£) for the the cost functional in (2.10). By the tower property of 


conditional expectations and from (5.1), the cost functional can be written, for all q E A, £ 
E as: 


J(0',q,0 


= E 


> E 


rT-h 


>0 


i (Y s + -yq s )ds + c(f) + J(T - h,X T _ h + £, Y T - h , D T _ h ] q, 0) (5.2) 


rT-h 


q s (W + 7 Qs)ds + c(0 + v NP (T - h, X T _ h + £, Y r _ hl D T _ h ) 


by definition (3.8) of the value function v NP for the optimal execution problem without 


production, i.e. the pure retailer problem. Since q is arbitrary in A, this shows that: 


inf J(0; q,£) 
q&A 


(5.3) 


> inf E 
q&A 


rT-h 


L J 0 


q s (Y s +'yq s )ds + c(^) + v NP (T - h,X T _ h + £,Y T - h , D T -h ) 


for all ^ E L°(J r 'p_/ 1 ). Now, given q E A, and £ E L°(J r j’_/ 1 ), let us consider the trading rate 
q NP, £ in Ax~h solution to the pure retailer problem: v NP (T — h, Xx-h + £, Yx~h, Dx-h), 
hence starting at time T — h from an inventory Xx~h + By considering the process q E 
A dehned by: q s = q s for 0 < s < T — h, and q s = q^ P ’^, for T —h < s <T 1 we then obtain 


from (5.2): 


■7(0;?,$) 

rT-h 


= E 


L J 0 


q s (Y s + 'yq s )ds + c(^) + v NP (T - h,X T -h + £,Y T -h, D T -h) , (5.4) 
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which proves together with (5.3) the equality: 


inf ' J(Q;q,Q 

qeA 


(5.5) 


= inf E 
q&A 


r T-h 


L J 0 


q s (Y s +'yq s )ds + c(£) + v NP (T - h,X T _ h + €,Y T - h ,D T -h) 


for all £ £ L°(X T -h)- Therefore, Vh = inf 9g ^^ gi ° (j? T _ h ) J(0;g,£) can be written as: 


Vh 


inf 

qzA£eL%(r T . 


E 


r T-h 


q s (Y s +'yq s )ds 


+ c(£) + v NP (T — h, Xx-h + £, Tr-h, Dt-h) 


(5.6) 


In other words, the original problem with delay in production is formulated as an opti¬ 
mal execution problem without delay, namely with final horizon T — h, and terminal cost 
function: 


C h (x,y,d,0 ■■= c(£) + v NP (T -h,x + £,y,d). 


Notice from the explicit expression of v NP in Remark 3.1 that this cost function Ch does 
not depend on T, and is in the form: 

C h (x,y,d ,£) = C h (0,y,d-x- £,0) = c(£) +v NP (T-h,0,y,d-x - £). 


The optimization over q and £ in (5.6) is done separately: the production £ £ U\(J-T—h) is 
decided at time T — h, after the choice of the trading rate (q s ) for 0 < s < T — h (leading 
to an inventory Xx~h) ■ and is determined optimally from the optimization a.s. at T — h 
of the terminal cost Ch(Xx-h, Yr—h, -Dr—ft, £)• It is then given in feedback form by £^_ h = 
i h ’ + (D T _ h - X T _ h ,Y T _ h ) where 

i h ' + (d,y) ■■= arg min C h (0, y,d — £, 0) = argmin [c(£) + v NP (T — h, 0, y, d - £)], 

£>o £>o 


hence explicitly given from the expression of v NP in Remark 3.1 by: 


£ ’ (d,y) = £ (d,y) l|k (diW )> 0 , 


i h (d,y ) := 


V 


(vh + 2y )(nh + d) + hy 
r] + /3 L (r(?7, /3) + v)h + 2y 


The problem (5.6) is then rewritten as 

rT-h 


Vh = inf E 
q&A 


L J o 


Qs(Y s + r yq s )ds + C£(D T ~h ~ X T -h, Yx~h ) 


(5.7) 


(5.8) 


where 


C+(d,y) := C h (0,y,d-i h ’+(d),0). 


Notice that when h = 0, we retrieve the expressions in the no delay case: £ 0,+ = £ + in 


(3.1), Cq = C + in (3.3) and vq = v in (3.2). As in the no delay case, there is no explicit 
solution to the HJB equation associated to the stochastic control problem (5.8). We then 
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consider the approximate control problem where we relax the non negativity constraint 
on the production, i.e. Vh = inf q&A,^&L°{J : T _ h ) <7(0;g,£). Therefore by following the same 
arguments as above, the corresponding value function is written as: 


where 


Vh 


f T ~ h 

inf E / q s (Y s + 'yq s )ds + C h (D T -h-X T -h,Y T -h) , 

qeA IJ Q J 


(5.9) 


C h (d,y) := C h (0,y,d-Z h (d),0). 


From the explicit expressions of in (5.7) and v NP in Remark 3.1 it appears after some te¬ 


dious but straightforward calculations that the auxiliary terminal cost function Ch simplifies 
remarkably into: 

Ch{d,y) = vo(T — h, 0, y, d) + K h , 


where vq is the auxiliary value function without delay explicitly obtained in Theorem 3.1 


and I\h is a constant depending only on the delay h and the parameters of the model, given 
explicitly by 

~ 2 al + ajv 2 + 2pcr 0 fT rf ^ 


K h = 


rj 


-h 


2 (y + /3)(r] + v)(r(r], ft) + is) 

, a o + a W - 2p*o<TdV ln A {d + v)h \ 
(n + i') 2 v ‘I'y ) 


a l + T dP) - < 2‘P cr 0 <J d r ( r h P) 


( r (r],P) + v) 2 


27 
In (1 + 


( r (h, P) + v )h 

27 


One easily checks that I\^ = 0 for h = 0, and Kh is increasing with h (actually the derivative 
of Kh w.r.t. h is positive), hence in particular Kh is nonnegative. Plugging into (5.9), we 
then get 


Vh 


= inf E 
q&A 


cT—h 


0 


Qs{y s + lQs)ds + v 0 {T - h, X T -h,Y T -h, D T -h) + K h . (5.10) 


Therefore, by using the dynamic programming principle for the control problem ho = 
vq( 0, x,y,d) in (3.4), we obtain this remarkable relation 

= v 0 + K h , 


Vh 


(5.11) 


which explicitly relates the (approximate) value function with and without delay. As ex¬ 
pected from the very definition of Vh, this relation implies that Vh — ho is nonnegative, and 
is increasing in h. This is consistent with the intuition that when making the production 
choice in advance, we do not take into account the future movements of the price and of 
the residual demand, which should therefore lead to an average positive correction of the 
cost. More precisely, the relation (5.11) gives an explicit quantification of the delay impact 
via the term Kh (which does not depend on the state variables x, y, d) in function of the 
various model parameters. Moreover, the optimal control of the stochastic control problem 
(5.10) over [0 ,T — h) is explicitly given by the optimal control (q s )o<s<T-h of problem ho 
without delay in Theorem |3.1[ 


Let us now consider the following strategy (q h,+ , i^’—h) ^ ^ x ^+(J r T-h) f° r the original 
problem Vh with delay: 
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• Before T — h, follow the trading strategy ql + = q s , s < T — h, corresponding to the 
solution of the auxiliary problem without delay as if production choice is made at time 
T, and leading to an inventory Xt-1i, and an impacted price Yp-h- 

• At time T — h, choose the production quantity: 

ir-h := £ h ’ + ( D T-h ~ Xr-hiYT-h)- 

jyp , * 

• Between time T—h and T, follow the trading strategy q s ’ + = q s ’ T ~ h , T—h < s < T, 
corresponding to the solution of the problem without production, and starting at T — h 
from an inventory X'x-h + £x’-h' 

In order to estimate the quality of this approximate strategy with respect to the optimal 
trading problem vp- measured by 

£? ■= J(0-,q h ’ + ,$* h )-v h , 

we shall compare it with the following strategy (q h ,£x~h) G A x L°(J-x~h) : 

• Before T — h, follow the trading strategy q^ = q s , s < T — h, corresponding to the 
solution of the auxiliary problem without delay as if production choice is made at time 
T, and leading to an inventory Xx-hi and an impacted price Yx—h- 

• At time T — h, choose the “production” quantity (which can be negative): 

ir-h = i h \Y>T~h ~ X T - h ,Y T - h ). 


N P 

• Between time T—h and T, follow the trading strategy q^ = q s ’ T ~ h , T — h < s < T, 
corresponding to the solution of the problem without production, and starting at T—h 
from an inventory Xx-h + £ x-h ■ 


Then, by construction an d fol 


leading to the expression (5.11 
i.e. v h = J(0;q h ,£%,_ h ). On t 
that 


owing the arguments (see in particular (5.4), (5.9), (5.10)) 
) of Vh, we see that (q h ,£x-h) the optimal solution for h/,, 
le other hand, since Vh < Vh < Ax-h)' we deduce 


max(v h - v h , S?) < £ h := J(0; q h ’ + ,^- h ) ~ J (°! Q h ,£x~h)- 

Now, from the expression (5.4) of J, and by same arguments as in the proof of Proposition |3.2| 
(see the derivation of relation (3.15)), we have 


£ h 


Cl ( Dt-h ~ Xt-h, Yr-h) — Ch{Dx~h ~ A 'r-h, Yr-h ) 


v np ( T ~ h, 0, Y T -h, Dr-h - X T -h ~ £x-h) + c (^T-h) 


~h.* 


- V NP (T - h, o, Y T _ h , D T _h - X T -h - ix-h) ~ c (ir-h) 
V r (v,P) ( r (ri:P) + v)h + 2 j (m(T,d- x,y)\ 

(,+^+27 Vhm y xm > 
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where m and if: are defined as in (3.12), and 


Vh{T) = 


crfis 2 + + 2y) 2 + 2pa 0 a d s(us + 2y) 

[(.rfaP) + v)s + 2 ~i ] 2 


ds. 


We recover when h = 0 the expression in Proposition |3.2| of the error in the no delay case, 
and notice that B h decreases when the delay increases: indeed, the error comes from the 
trading procedure before deciding how much to produce, which is dictated by the auxiliary 
problem, in which the final “production” can be negative. After T — h, the followed control 
is optimal, as there remains no production decision at some further date. The shorter the 
period before making the production decision is, the weaker the error is. 

Let us finally discuss some properties of the (approximate) optimal trading strategy 
q h,+ . Recalling from Proposition 3.1 that the optimal trading rate is a martingale in the no 
delay case, we see by construction of (qs’ + )o<s<T that it is a martingale on [0,T — h) and 
a martingale on [T — h,T\. Moreover, for any s G [T — h,T], and 1 G [0 ,T — h), we have 


e[?; 


:/l,H 


= 


K[E[qs P ^ T - h \T T - h ]\T t ] = E [q 1 ^ 


NPtr-h 
h 


Tt 


= E 


r](ph + D T -h - X T -h - iT-h ) - 


= E 


(r/ + v)h + 2y 

rj()j,h + Dx-h — X-x-h ~ £ T-h ) — Yr-h 


(V + v)h + 2y 


- T t 


VTt 


+ 


V 


-E 


th C 11 -,* I 77 

VT-/i S T-h^t 


h,* 


= E 


(?y + u)h + 2y 
r{rj, P)(l J ‘h + Dx-h ~ Xx-h) - Yx-h 


(r{rj, (3) + v)h + 2y 


+ 


V 


{rj + u)h + 2y 
^[qT-h\Xt] + 


E 


ch I tt 

VT-/i ^T-h\ J ~ t 


(rj + v)h + 2y 


E 


- T t 


Zh,* 


= Qt + 


n 


-E 




£r~l 

v T-h~ $ T-h 
'-Tt 1 


Tt 




< Qt = q t ' 


(5.12) 


(rj + v)h + 2y 

where we used the tower rule for conditi onal expectations, the martingale property and the 
explicit expression of q NP, ^T-h i n Remark 3.1 the definition of the martingale property 

and explicit expression of q in Theorem 


3.1 


and finally the fact that £ T ’-h = ^T-h^^ h >0' 

This shows in particular the supermartingale property of q h,+ over the whole period [0, T], 


Notice that the same arguments as for the derivation of (5.12) shows the martingale property 


over the whole period [0, T\ of the optimal trading strategy q h associated to the auxiliary 
problem v^. Moreover, by the martingale property of q h ' + on [0, T — h), and relation (5.12), 
we see that the (approximate) optimal inventory process X h,+ with trading rate q h,+ has 


on average, a growth rate dE ^ B —1. which is piecewise constant, equal to: 


m. 


Qo, 

Qq } := Q0 + ( v +u)h+2'y^i^T-h 1 ^_ h <o\ 


< Qo, 


for 0 < s < T — h 
for T — h < s <T, 
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with n — r ('n^))(l-iT+d-.i-)-y , 

With qo — (r(jj, / 8 )+t/)T +27 ’ alld 


4h) 

Qo = Qo 


Vr(v,P) 


P((r] + v)h + 27) 




m{T,d~x,y) 

s/Wt) 


where 


'ijj(z) := (j)(z) — z&(—z), z E 
is a nonnegative function, as pointed out in (3.17|). 


5.2 Numerical results 


We plot figures showing relevant trajectories with the same parameters as in Section 3.3.2 
We add a delay h = 4 hours: the production choice has to be made four hours before the 
end of the trading period. We have 

v h (0,X 0 ,Y 0 ,D 0 ) = 19254606, 

y higher than the value vq(0, Xq,Yq, Do) = 19167006 without delay. 


which is slight 
On Figure 

and then we go on buying shares on the intraday market in order to go nearer to the demand 


10 


we see that at time T — h, the positive production choice Ct\ ma de 


forecast, with a smaller slope of trading rate. On Figure 11 which represents the control 
process without the last hour of trading (because oscillations then become overwhelming), 
we see that after date T — h, as we do not plan to use final production leverage any more, 
the approximate optimal control process q h,+ oscillates a lot as we are approaching the end 
of trading time. We can compare with Figure [l] to assert that qualitatively, the control 
in the problem with no production oscillates more than the one in the problem with final 
production, as in the former problem, the intraday market is the only way to seek to reach 
the equilibrium. 



Figure 10: Evolution of the inventory X (with production choice at time T —h) and of the forecast 
residual demand D 
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Figure 11: Evolution of the trading rate control q h ’ + without the last hour 


A Appendix 

A.l Proof of Theorem 13.11 

The Hamilton-Jacobi-Bellman (HJB) equation arising from the dynamic programming as¬ 
sociated to the stochastic control problem (3.4) is: 

dv r dv dv dv 1 2 d 2 v 1 2 d 2 v d 2 v 

» + Si + "igy + "Bd + 7°w + T'W + pam aiad + q(v + 7,) 1 = °- 


dt qe R L dx dy dd ' 2 ” u dy 2 ' 2 
y(T,x,y,d) = C(d-x ) = \r{rj, /3){d - x) 2 . 

The argmin in HJB is attained for 


-/ A 1 rdv dv 

c(t, x ,y,d) = + + 


and the HJB equation is rewritten as: 

dv dv 1 2 d 2 v 1 2 9 2 v d 2 

b7 + + b°bbb2 + + P a ° ad 


dt ' ^dd ' 2~"dy 2 ' 2 “ a dd 2 
v(T,x, y, d) = ±r{rj,/3){d- x) 2 . 


1 rdv dv i 
dydd 4'y'-dx^~ l 'dy Jl ~^^ 


(A.l) 


We look for a candidate solution to HJB in the form 

w(t,x,y,d ) = A(T — t)(d — x) 2 + B(T — t)y 2 + F(T — t)(d — x)y 
+ G(T -t)(d-x) + H(T - t)y + K (T - t), 


(A.2) 


for some deterministic functions A, B , F. G, H and K. Plugging the candidate function w 
into equation (A.l), we see that w is solution to the HJB equation iff the following system 
of ordinary differential equations (ODEs) is satisfied by A, B, F, G, H and K: 


A' + ±(-2A + vF) 2 = 0 
B' + ±{2vB-F + 1) 2 = 0 
F' + A(—2A + vF)(2vB — F + 1) = 0 

G'-2nA + ±(-2A + vF)(-G + vH) = 0 
H' -/jlF+±(2vB-F + 1)(-G + vH) = 0 
, K' — — (ctqB + cr^A + pa^a^F) + ^(—G + uH) 2 = 0 
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with the initial conditions A( 0 ) = ^r(rj,/3), B(0) = 0, F( 0) = 0, G( 0) = 0, H( 0) = 0, 
A"(0) = 0. We first solve the Riccati system relative to the triple (A, B, F ), and obtain: 


' A ( + \ - ''fa/ j )(§*+7) 

1 ' (r(r;,/ 3 )+i/)i +27 * 


R(f) = _I_ t _ F ( f \ = _ r(ri,P)t _ 

' 2 (r(?j,/ 3 )+^)t +27 ! 1 > (r(r),P)+v)t+2'i ' 


(A.3) 


Then we solve the first-order linear system of ODE relative to the pair (G, F[), which leads 
to the explicit solution: 


G(t ) = 2ytA(t), and H(t) = —2 r(r], (3)ytB(t). 


(A.4) 


Finally, we explicitly obtain K from the last equation: 

,°0 + a d r ‘ 2 ( r l^) ~ 2pcrocr d r(v,P) 


K(t) = 7 - 


+ 


(r( 7 ?,/ 3 ) + i,) 
o~;H 7 b P) u + 2pcr 0 o-dr(r?, ft) - erg 
2 (?’(??, P) + v) 


In ^1 + 


{r(y,/3) + v)t 
27 


t + 


r(T7,/3)/A 2 (|t + 7 ) 

(r{ri,P) + v)t + 2 7 ' 


(A.5) 


By construction, u) in (A.2) with A, B, F, G, H and K explicitly given by (A.3)-(A.4)- 


(A.5), is a smooth solution with quadratic growth condition to the HJB equation (A.l) 


Moreover, the argmin in HJB equation for w is attained for 


q(t,x,y,d ) = 


1 rdw dw 

"2^fe + ^ +2/] 

r{jh P)(K T ~ t) + d - x) - y 

{r(jh P) + v)(T - t) + 27 


= : q(T — t,d — x, y). 


Notice that q is linear, and Lipschitz in x, y, d, uniformly in time t, and so given an i nitia l 
state (x, y , d ) at time t. there exists a unique solution (X t,x ’ y,d ,Y t,x,y,d , D t,d )t< s <T to ( 2 . 1 )- 
(2.T)-(2.8) with the feedback control q s = q(T — s, — xi' x,y,d ,Ys' x ' v,d ), which satisfies: 


E[sup t<s<r \xl’ x ' y ’ d \ 2 +\Ys’ x ’ y ' d \ 2 +\D t s ’ d \ 2 ] < 00 . This implies in particular that E[J^ |g s | 2 ds] 
< 00 , hence q E At- We now call on a classical verification theorem (see e.g. Theorem 3.5.2 
in 0 ), which shows that w is indeed equal to the value function v. and q is an optimal 
control. Finally, once the optimal trading rate q is determined, the optimal production is 


obtained from the optimization over £ E M of the terminal cost C(D^ d — Xj, x ’ y,d ,^), hence 

□ 


given by: (D^ 1 - X%, x ’ y ’ a ). 


>t,x,y,d\ 


V+P 


A.2 Proof of Theorem 14.11 

The Hamilton-Jacobi-Bellman (HJB) integro-differential equation arising from the dynamic 
programming associated to the stochastic control problem v = v ^ with jumps in the dy¬ 
namics of Y and D is: 


dv r dv^ dv^ dv^ 1 2 d 2 v^ 

+ mf [q^ -1- vq—^. -h ^ + -<t 0 - 

geM L ox dv ad 2 


dt 


dy 


dd 


dy 2 


+ A [p + v^ x \t, x,y + vr + , d + J + ) + p v^ x \t, x,y + n 
v^ x \T,x,y,d) = C(d — x) = \r(y, P)(d — x) 2 . 


1 2 d 2 v^ d 2 v^ 

+ Y‘^ + imad -d^d +q(:u + ' iq '’‘ 

d + 5~) — v^ x \t,x,y,d)\ = 0 
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Notice that with respect to the no jump case, there is in addition a linear integro-differential 
term in the HJB equation (which does not depend on the control), and the argmin is attained 
as in the no jump case for 


q W (t,x,y,d) = 
The HJB equation is then rewritten as 


1 rdv^ dv 

+ u ~d^ + y \ 


dvdv^ 1 1 2 d 2 vH) d 2 1 dv^ 

dt ^ dd 2 <T ° dy 2 + 2° d dd 2 P a o a d dydd 4 y ^ dx V dy 
+ A [p + v( x \t, x, y + 7 r + , d + <5 + ) + p~v^ x \t, x,y + 7r~ ,d + 8 ~) — v^ x \t, x, y, d)] 
vW(T,x,y,d) = ±r{rj,P)(d-x) 2 . 


We look again for a candidate solution to (A. 6 ) in the form 


+ y\ 

= 0 

(A. 6 ) 


w W (t,x,y,d) = A X {T -t)(d- x) 2 + B X (T - t)y 2 + F X {T -t)(d- x)y 

+ G X (T -t)(d-x) + H X (T - t)y + K X {T - t), (A.7) 


for some deterministic func tions A x , B x , F x . G x , H x and K x . Plugging the candidate 
function w^ into equation (A. 6 ), we see that w^ is solution to the HJB equation iff the 
following system of ordinary differential equations (ODEs) is satisfied by A x . B x , F x , G x , 
H x and K x : 


A'x + + vF x ) 2 = 0 

H' x + ^{^B x ~ F x + 1) 2 = 0 

F' x + ^(— 2A X + uF x ){2uB x — F x + 1 ) = 0 

G' x -2iiA x + ±(-2A x + vF x )(-G x + vH x )-\(26A x + TrF x ) = 0 

H x — pF x + H{2uB x — F x + 1)(—G X + vH x ) — \{2-kB x + 8F X ) = 0 

K x — yG x — (erg B x + < x 2 A x + paoadF x ) + ^(— G x + vF[ x y 
-X[(p + (8 + ) 2 +p-(6~) 2 )A x + (p + (vr + ) 2 +p~(it-) 2 )B x 

+(p + 8 + n + + p~8~n~)F x + 5G X + i tH x \ = 0 


with the initial conditions Aa(0) = ^r(p,/3), B x ( 0) = 0, F x ( 0) = 0, Ga(0) = 0, H x (0) = 0, 
K x { 0) = 0. We first solve the Riceati system relative to the triple (A x , B x , F x ), which is 
the same as in the no jump case, and therefore obtain: A x = A , B x = B, F x = F as in 
(A.3). Then we solve the first-order linear system of ODE relative to the pair (G X ,F[ X ), 
which involves the jump parameters A, 7 r and 8, and get: 


Gx(t) 
Hx (t) 


A r(rj, (3)t(irt + 28{vt + 2 q)) 
U 2 (r(rj, /3) + u)t + 2y 
_ A (tt - 2r(r), (3)8)t 2 
U 2 (r(? 7 , fi) + u)t + 27 ’ 
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where G and H are given from the no jump case (A.4). Finally, after some tedious but 
straightforward calculations, we explicitly obtain K\ from the last equation: 


I< x (t) = K(t) + A 7 


P + (tt + — r(rpf3)5 + ) 2 +p (7 r —r(p,/3)5 ) 2 


{r{ri,P) + v)‘ 


In (l + 


(r(-n,P) + v)t 
27 


A p + ((ir + ) 2 — r(r], /3)6 + (2 tt + + v5 + )) + p~((j r - ) 2 — r(ry, /3)5~(2tt~ + l>8~)) 
2 r(p,(3) + v 

Xr(r],/3) 2vp5 + A((p + ) 2 d + ( 7 r + + v5 + ) + (p~) 2 5~ (n~ + v5~)) +2 

~\\~ ~7 / ~ " t 


+ A 2 7 r(? 7 ,/ 3 ) 


r(v, ft) + v 

r(r), /3)5 2 + 2 up + p~5 + 5~ — (( p + ) 2 8 +/ jt + + (p~) 2 5~n~) 
(r(r],/3) + v)((r(ri,/3) + v)t + 2j) 


+ 


2 A 7 r ( 77 , (3) 2 p5 


X 2 n 2 


t z - 


( r ( r h P) + z') ((r(rj, f3) + v)t + 27) 487 

A 2 p + p~r(r/, (3) 2v5 + 5~ + 5 ~tt + + 8 + tt~ 3 
2 (r( 7 ,^) + u)t + 2y 

1 4r( 7 , (3)p,Xn — A 2 7t 2 ^ 3 
8 (r(?7,/3) + v)t + 2 7 


with FT in (A.5). The function u)A) in (A.7) may thus be rewritten as the sum of w in 
(A.2) and another function of t, d — x and y, and is by construction a smooth solution 
with quadratic growth condition to the HJB equation (A. 6 ). Moreover, the argmin in HJB 
equation for u)A) is attained for 


q W (t,x,y,d) = 


1 r chuA) <9rZ)A) 
~^~d^ + V ~d^~ + V ^ 
r(y, P)(v(T - t) + d - x) - y 


+ A 


(rfa P) + v)(T - t) + 2j 
r {p,mT-t) + ^{r{y,f3) + u){T-t) 2 


(r(j],P) + v)(T-t) + 2 7 
= : q W (T -t,d- x,y). 

Again, notice that <7 A) is linear, and Lipschitz in x, y , d. uniformly in time t, and so given an 
initial state (x, y , d) at time t, there exists a unique solution (X t,x,y,d ,Y t,x,y ’ d , D t,d )t< s <T to 
(2.1 )-(4.3)-(4. 1 ) with the feedback control q ^ = q^(T — SjD*/ 1 — x*’ x ’ y ’ d , Yg ,x,y,d ^ which 
satisfies: E[sup t<s<T \ xi ’ x,y ’ d \ 2 + \Ys’ x ’ v,d \ 2 + \D t s ’ d \ 2 ] < 00 , see e.g. Theorem 1.19 in [7]. This 
implies that |gi A ^| 2 ds] < 00 , hence gA) £ _4 t . We now call on a classical verification 

theorem for stochastic control of jump-diffusion processes (see e.g. Theorem 3.1 in 13), 
which shows that w A) is indeed equal to the value function {SA) ; and <7 A) j s an optimal 
control. Finally, once the optimal trading rate <7 A) j s determined, the optimal production is 
obtained from the optimization over £ G M of the terminal cost C(D^ d — X^, x ’ y ’ d ,^), hence 

t, d ^x,y,dy □ 


given by: ^ A) = - X, 
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